Linear-quadratic control and information relaxations

نویسندگان

  • Martin B. Haugh
  • Andrew E. B. Lim
چکیده

We apply the recently developed duality methods based on information relaxations to the classic linear quadratic (LQ) control problem. We derive two dual optimal penalties for the LQ problem when the control space is unconstrained. These two penalties, which are derived using value function and gradient methods, respectively, may be used to evaluate sub-optimal policies for constrained LQ problems when it is not possible to determine the optimal policy exactly. We also compare these dual penalties to the dual penalty of Davis and Zervos (1994). This connection to the earlier work of Davis and Zervos is not widely known and demonstrates that some of these duality ideas have been in circulation for some time. We also emphasize that while the three penalties are dual optimal, they are not identical. Indeed their differences have significant implications when the penalties are used via Monte-Carlo to evaluate sub-optimal policies for constrained LQ problems. Our conclusions should apply more generally to other stochastic control problems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Convex relaxations for mixed integer predictive control

The main objective in this work is to compare different convex relaxations for Model Predictive Control (MPC) problems with mixed real valued and binary valued control signals. In the problem description considered, the objective function is quadratic, the dynamics are linear, and the inequality constraints on states and control signals are all linear. The relaxations are related theoretically ...

متن کامل

Linear Programming Relaxations of Quadratically Constrained Quadratic Programs

We investigate the use of linear programming tools for solving semidefinite programming relaxations of quadratically constrained quadratic problems. Classes of valid linear inequalities are presented, including sparse PSD cuts, and principal minors PSD cuts. Computational results based on instances from the literature are presented.

متن کامل

Semidefinite Programming Relaxations for the Quadratic Assignment Problem

Semideenite programming (SDP) relaxations for the quadratic assignment problem (QAP) are derived using the dual of the (homogenized) Lagrangian dual of appropriate equivalent representations of QAP. These relaxations result in the interesting, special, case where only the dual problem of the SDP relaxation has strict interior, i.e. the Slater constraint qualii-cation always fails for the primal...

متن کامل

From Linear to Semidefinite Programming: An Algorithm to Obtain Semidefinite Relaxations for Bivalent Quadratic Problems

In this paper, we present a simple algorithm to obtain mechanically SDP relaxations for any quadratic or linear program with bivalent variables, starting from an existing linear relaxation of the considered combinatorial problem. A significant advantage of our approach is that we obtain an improvement on the linear relaxation we start from. Moreover, we can take into account all the existing th...

متن کامل

Semidefinite Relaxations of Ordering Problems

Ordering problems assign weights to each ordering and ask to find an ordering of maximum weight. We consider problems where the cost function is either linear or quadratic. In the first case, there is a given profit if the element u is before v in the ordering. In the second case, the profit depends on whether u is before v and r is before s. The linear ordering problem is well studied, with ex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Oper. Res. Lett.

دوره 40  شماره 

صفحات  -

تاریخ انتشار 2012